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Abstract 

This paper presents the results of statistical analysis of a set of physico-chemical water quality 
parameters, monthly collected from December 2012 to November 2013 at the seven different sampling 
stations spread over the Shatt Al-Arab River. Seventeen parameters were treated using Multivariate 
statistical technique, such as principal component analysis (PCA) and cluster analysis (CA) were applied 
for the evaluation and interpretation of a water quality data set for the Shatt Al-Arab River.The results of 
PCA identified four latent factors, which are responsible for the data structure explaining 78.64% of the 
total variance of the dataset these factors are: Water mineralization. Seasonal effect of temperature and 
organic pollution. Nutrients content and water visibility. CA showed four different groups of similarity 
between the sampling stations reflecting the different physicochemical characteristics features and natural 
background sources types. This study suggests that PCA and CA techniques are useful tools for 
identification of important surface water quality monitoring stations and parameters. 
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Introduction 

Shatt Al-Arab River is one of the most important interior rivers in Iraq, because of its economic 
and social values. It is the main source of surface water in Basrah governorate. Water of Shatt Al-Arab has 
been used for various purposes including potable water supply, irrigation, fisheries, navigation, and 
industrial uses [1]. Moreover, Shatt Al-Arab River is the conjunction link of freshwater from Iraq into the 
Arabian Gulf. This river has been subjected to several pollution activities, mostly from domestic waste 
discharge, industrial waste and agricultural and as put this waste directly into the river without any handle 
them and this waste be loaded mostly pollutants may pose serious health and can adversely affect the use 
beneficial to the river [2]. Because rivers constitute the main inland water resource for domestic, industrial 
and irrigation purposes, it is imperative to prevent and control river pollution and to have rehable 



Www.bumej.com 



39 



Mesopotamia Environmental Journal (MEJ) 

2014, Vol.1, No.l:36-46. 

information on the quality of river water. In view of the spatial and temporal variations in the 
hydrochemistry of rivers, regular monitoring programmes are required for reliable estimates of the water 
quality [3]. The quality of water is identified by its physical, chemical and biological properties. The 
particulate problem in case of water quality monitoring is the complexity associated with analysis a large 
number of measured variables [4]. The data sets contain rich information about the behavior of the water 
body. The classification, modeling and interpretations of monitoring data are the most important steps in 
the assessment of water quality. The application of different multivariate approaches, such as principal 
component analysis (PCA) and cluster analysis (CA) for the interpretation of these complex data matrices 
offers a better understanding of water quality and ecological status of the studied systems. It also allows the 
identification of possible factores/sources that influence water system and offer a valuable tool for the 
reliable management of water resources as well as rapid solutions to pollution problems [5, 6]. In the 
present paper, the data matrix obtained during one year monitoring program at seven different stations 
spread over the Shatt Al-Arab River, subjected to different multivariate statistical approaches in order to 
extract information about the similarities or dissimilarities among monitoring stations, and to determine the 
influence of possible sources (natural and anthropogenic) on the physico-chemical variables of the Shatt 
Al- Arab River. 

Materials and Methods 
Study area and sampling sites 

The confluence of the Tigris and Euphrates rivers at the town of Qurna, north of Basra city forms 
the Shatt Al- Arab River, which flows to the south west to the Arabian Gulf. The Shatt Al-Arab River has a 
length of 200 km, a width range between 400 m at Basra and up to more than 2 km at the estuary and a 
depth of between 8-15 m, considering tides [7]. The hydrological condition of the Shatt Al-Arab River basin 
is affected by several factors including conditions at the upper reaches of the Tigris and Euphrates rivers, 
the status of advancing flood tides from the Arabian Gulf, seepage of saline ground water into the basin, as 
well as the impact of climate conditions prevailing in the region on discharge rates and the payload of the 
river [8]. The discharge of moderately saline water from the East Hammar marsh and limited precipitation 
exacerbates water quahty and the hydrology of the region [9]. Seven sampling stations were established 
along the river course in order to obtain a good knowledge to detect spatial similarity for grouping 
sampling stations located within the monitoring area, and determine thefactors or sources responsible for 
water quality variations in quality of the river (Table 1, Fig. 1). 
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Table 1. Sampling locations and coordinates of sampling stations at Shatt Al-Arab River 

Stadon nmniba: Station name Langitude Latitude 

St l Garma 47M5 32. 39 30-^37 29 85 

SL2 Maqal 47" 47 15.01 30-^33 59J2 

a 3 JL^aj 47*" 51 29.10 30'= 30 12.43 

SL4 MoMa 47*^55 34.73 30^28.5.53 

SL5 Ahuilouse 4*M1 16.94 30^27 37.77 

SL6 Stba 4«M 636.48 30'= 19 52.41 

SL7 fH 48*" 27 52.75 29'= 59 23.73 




Figure. l.Map of Shatt Al-Arab River showing the seven sampling stations 



Field Sampling and procedures 

Monthly water samples were collected from selected seven stations in Shatt Al-Arab River from December 
2012 to November 2013. Water samples collected for analysis of nutrient, major ions were filtered using 
Millipore filter paper (0.45 |im) and preservedand analyzed according to the standard methods described in 
[10]. Water physical and chemical parameters including water temperature (WT), electrical conductivity 
(EC), salinityand pH were measured insitu usingthe WTW Multi-meter model 4430. Turbidity was 
dQiQxmmQddnsitu by WTWturbidity-meter type Terb 550. Dissolved oxygen (DO) andBiochemical oxygen 
demand (BOD 5) were determined by the Winkler azide method. Alkalinity was determined by acidtitration 
using methyl-orange as endpoint. Calcium (Ca-F2) and magnesium (Mg-F2) concentrations were measured 
by ethylene- ediaminetetraacetic acid (EDTA) complexometricy titration. Sodium (Na-F) and potassium 
(K-i-)concentrations were measured by flame photometry and chloride (C1-) concentration by silver nitrate 
(AgN03) titrationusing potassium chromate (K2Cr04) solution as an indicator. Sulphate (S04"^) 
concentration was determined specrophotometrically by barium sulphate turbidity method. Nitrite nitrogen 
(N02-N) was determined according to colorimetric method. Nitrate nitrogen (N03-N) and orthophosphate 
phosphorus (P04-P) concentrations were measured by ultraviolet (UV) and molybdate ascorbic acid 
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methods, respectively. Statistical computations were executed using the statistical software package, XL 
STAT pro v.4.In this study, each water quality parameter was standardized (z-scale) before PC A and CA 
analysis was performed in order to minimize the influence of different variables and their respective units 
of measurements [11]. 

Results and Discussion 

Principal component analysis 

It is difficult to draw a clear conclusion directly because the complexity of relationships between 
parameters of water quality, conclusion directly. However, PCA can explain the structure of the data in 
detail by extracting the latent information [11]. PCA is designed to transform the original variables 
(parameters in this study) in to new, uncorrelated variables (axes), called the principal components (PCs), 
which are linear combinations of the original variables [12]. In this study,the scree plot was used to identify 
the number of PCs to be retained in order to comprehend the underlying data structure [3, 5]. According to 
eigenvalue-onecriterion[13], only four PCs (with eigenvalue >1) were considered which explaining 78.64% 
of the total variance in the water quality datasets, the rest PCs were eliminated (Table 2; Fig. 3, 4). In order 
to make easier interpretation for the PCAresults, the first four PCs were rotated according to varimax 
methodwith Kaiser Normalization[14]. Results of the PCs from the Varimax rotations are displayed in 
Table 2 and Fig. 3, 4. The first rotated component (PCI) which explains 32.80% of total variance in the 
datasets and hasstrong positive loadingon EC, sahnity, C1-, Mg+2, K+, S04 -2 and Na+, respectively, 
whereas, it has moderate positive loading on Ca+2 (Table 2) (Table 2; Fig. 3). This component can be 
related to mineral component of the river water. The parameters on this component indicatefluctuationand 
reduction in water flow and increase in soluble saltsfrom Arabian Gulf [2, 15]. The PC2 explained 21.41% 
of total variance in datasets and consists of a strong positive loading of WT, alkalinity and BODS, while 
strong and moderate negative loading with DO and pH respectively (Table 2). This component was mainly 
associated with the seasonal effect of temperatureand anthropogenic pollution sources and can be explained 
that high levels of dissolved organic matters consume large amounts of oxygen, which results in formation 
of ammonia, C02 and organic acids leading to a decrease in water pH values (Volga et al., 1998; Shrestha 
and Kazama, 2007). The PC3explained 14.85% of the total variance and was positively and largely 
contributed by inorganic nutrients-related parameters (N03-N, N02-N andP04-P) (Table 2; Fig. 4). This 
component distinguishes the importance of anthropogenic inputs mainly from non-point pollution 
sources(such as agricultural runoff and atmospheric deposition) [6], and point pollution sources (such as 
domestic and industrial wastewater effluents) [16]. According to [17], high levels of both dissolved 
inorganic nitrogen and phosphorusin Shatt Al-Arab water ,resulted from both point and non-point pollution 
sources. PC4explains 9.57% of total variance has strong positive loading on turbidity. This component can 
be attributed to the water visibility.In summary, four extracted factors (PCs) representing four different 
processes responsible for water quality variations in Shatt Al-Arab River are: (1) Water mineralization, (2) 
Seasonal effect of temperature and organic pollution, (3) Nutrients content, and (4) water visibility. 
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Figure 2. Scree plot of the eigenvalues. 
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Table 2. The principal components After Varimax Rotation 
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Figure 3. Component loadings for the (a) with PCI and PC2, and (b) with PC2 and PCS after varimax 
rotation 
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Spatial similarity and sample station grouping using cluster analysis 

The purpose of cluster analysis is to identify groups or clusters of similar stations on the basis of 
similarities within a class and dissimilarities between different classes [18]. Hierarchical agglomerative 
clustering is the most commonapproach, which provides instinctive similarity relationships between any 
one sample and the entire dataset, is typically illustrated by a dendrogram (tree diagram) [19]. The 
dendrogram presents a picture of the groups and their proximity to one another, with a dramatic reduction 
in the dimensionality of the original data [20]. To detect spatial similarity among groups, CA was applied 
to the seven sampling stations. Hierarchical agglomerative CA was performed on the normahzed data set 
by means of the Ward's method, using Euclidean distances as a measure of similarity. The clustering 
procedure generated four groups of stations in a convincingway, indicating relatively high indepen-dency 
for each group (Fig. 4). In the other word, the stations in these groups have similar characteristic features 
and natural background sources types. Cluster 1 (XI) consisted St. land St.2, which were located at the 
upper reach of the river, were population density is considerably moderate and it's characterized bya large 
number of orchards and farmland especially on the left bank.The common feature of these stations was 
well oxygenated and lower soluble salts content comparedtothe other stations (Fig. 5). Cluster 2 (X2) 
consisted St. 3 and St. 4,which were located at the middle part of the Shatt Al-Arab River, where impacted 
by heavily populated areas of Basra City and it is likely that high values of BOD5 and nutrient concentrations, as 
well as, depletion in pH and DO concentration in these stations indicate water pollution resulting from, sewage 
effluents discharged from Basrah city, as well as industrial and agricultural activities (Fig. 5). The cluster 3 (X3), 
including St. 5 and St. 6 which located at the river downstream.The common feature of these stations 
wasrelatively higher turbidity, Na, CI and S04Compared to other stations in clusters 1 and 2 (Fig. 5). The 
cluster 4 (X4), consisted St.7, which were located close the mouth of the Arabian Gulf, and it is 
negativelyaffected by the high level of salty water intrusion from the Arabian Gulf (Fig. 5). The common 
feature of this station was high sahnity, EC, major ions, alkalinity concentrations and turbidity, whereas, it 
was low nutrient content compared to the other stations (Fig. 5). 

In summary, hierarchical CA helped to group the seven samphng stations into four clusters of 
similar characteristics reflecting the different water quahty characteristics and pollution (natural and 
anthropogenic) sources. It is evident that the CA technique is useful in offering reliable classification of 
surface waters throughout a whole region, and will make it possible to accurately perform spatial 
assessment in an optimal manner. Thus, it could be employed to design a future spatial sampling strategy 
in an optimal manner by reducing the number of sampling stations without losing any significance of the 
outcome. 
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Figure 4.Dendrogram obtained by agglomerative hierarchal clustering analysis for sampling stations 
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Figure 5. 56>x /?/6>^>y represent spatial variation of the water quality parameters between clusters in Shatt 
Al- Arab River 
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